High Throughput Queue Algorithms

نویسندگان

  • Daniel Orozco
  • Elkin Garcia
  • Rishi Khan
  • Kelly Livingston
  • Guang Gao
چکیده

Advanced many-core CPU chips already have few hundreds of processing cores (e.g. 160 cores in an IBM Cyclops-64 chip) and as computer architecture progresses, more and more processing cores become available. The underlying runtime systems of such architectures need to efficiently serve hundreds of processors at the same time, requiring all basic data structures within the runtime to maintain unprecedented throughput. In this paper, the design of concurrent queues is explored to meet the runtime system and algorithm demands for hundreds of simultaneous transactions being handled in real time. We explore some of the basic standard queueing techniques, showing their weaknesses in a highly parallel environment. We then develop a simple high throughput queue. The algorithm is then extended to address all cases and to provide all the functionality commonly found in queues. We test all queueing algorithms on Cyclops-64, a many-core processor by IBM. The following are the major contributions of this paper: 1. Queueing theory is used to provide a mathematical background for the inherent throughput of queueing algorithms. 2. The Circular Buffer Queue algorithm: A very high throughput, low latency queue implementation useful for most common cases. 3. The High Throughput Queue algorithm: A high throughput, low latency queue implementation that supports all traditional operations on a queue. 4. Experimental results that show that for highly parallel systems, both the Circular Buffer Queue and the High Throughput Queue allow larger scalability and higher performance than other famous, state of the art implementations. 5. We show that the queue algorithms presented are linearizable and behave like nonblocking, wait-free algorithms for practical implementations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improvement over Random Early Detection Algorithm: A Self-Tuning Approach

Random Early Detection (RED) is one of the most commonly used Active Queue Management (AQM) algorithms that is recommended by IETF for deployment in the network. Although RED provides low average queuing delay and high throughput at the same time, but effectiveness of RED is highly sensitive to the RED parameters setting. As network condition varies largely, setting RED's parameters with fixed ...

متن کامل

Congestion Control Using Predictive Approach in Mobile Ad Hoc Network

The Active Queue Management algorithms stabilize the instantaneous queue length through mapping the congestion measurement into packet drop probability to achieve high throughput and low average delay. Random Early Detection [RED] is widely used AQM mechanism detecting and avoiding the incipient congestion. The detection of congestion is based on priori estimation of congestion and calculation ...

متن کامل

Optimal Scheduling Algorithm for Throughput Maximization in Multihop Wireless Networks

In this paper, focus on designing a scheduling scheme for achieve maximum throughput. Although, it does not require per-flow or perdestination information and also, this paper consider the problem of link scheduling in Multihop Wireless Networks under general interference constraints. The main goal is to achieve maximum throughput and better delay performance at low complexity. Previously, we u...

متن کامل

Performance Evaluation of Aqm Algorithms for Pgm Based Group Communication in Dvmrp Multicasting Network

Queue management schemes at the routers and congestion avoidance schemes at the end points cooperate to provide good congestion solutions in computer networks. While queue management schemes are still being developed, research on congestion avoidance has come a long way to serve the bandwidth requirement of the networks. Because of considerable lack on the evaluation research work, there is no ...

متن کامل

RIPQ: Advanced Photo Caching on Flash for Facebook

Facebook uses flash devices extensively in its photocaching stack. The key design challenge for an efficient photo cache on flash at Facebook is its workload: many small random writes are generated by inserting cachemissed content, or updating cache-hit content for advanced caching algorithms. The Flash Translation Layer on flash devices performs poorly with such a workload, lowering throughput...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011